Figure 2 



Gene Sequence of sddl7, an Omega-3 Fatty Acid Desaturase Gene 
from Saprolegnia diclina (ATCC 56851) 



1 


ATGACTGAGG 


ATAAGACGAA 


GGTCGAGTTC 


CCGACGCTCA 


CGGAGCTCAA 


51 


GCACTCGATC 


CCGAACGCGT 


GCTTTGAGTC 


GAACCTCGGC 


CTCTCGCTCT 


101 


ACTACACGGC 


CCGCGCGATC 


TTCAACGCGT 


CGGCCTCGGC 


GGCGCTGCTC 


151 


TACGCGGCGC 


GCTCGACGCC 


GTTCATTGCC 


GATAACGTTC 


TGCTCCACGC 


201 


GCTCGTTTGC 


GCCACCTACA 


TCTACGTGCA 


GGGCGTCATC 


TTCTGGGGCT 


251 


TCTTCACGGT 


CGGCCACGAC 


TGCGGCCACT 


CGGCCTTCTC 


GC6CTACCAC 


301 


AGCGTCAACT 


TTATCATCGG 


CTGCATCATG 


CACTCTGCGA 


TTTT6ACGCC 


351 


GTTCGAGAGC 


TGGCGCGTGA 


CGCACCGCCA 


CCACCACAAG 


AACACGGGCA 


401 


, ACATTGATIVA 


GGAC6AGATC 


TTTTACCCGC 


ACCGGTCG6T 


CAA6GACCTC 


451 


CAGGACGTGC 


GCCAATGGGT 


CTACACGCTC 


GGCGGTGCGT 


GGTTTGTCTA 


501 


CTTGAAGGTC 


GGGTATGCCC 


CGCGCACGAT 


GAGCCACTTT 


GACCCGTGGG 


551 


ACCCGCTCCT 


CCTTCGCCGC 


GCGTCGGCCG 


TCATCGTGTC 


GCTCGGCGTC 


601 


TGGGCCGCCT 


TCTTCGCCGC 


GTACGCGTAC 


CTCACATACT 


CGCTCGGCTT 


651 


TGCCGTCATG 


GGCCTCTACT 


ACTATGCGCC 


6CTCTTTGTC 


TTTGCTTCGT 


701 


TCCTCGTCAT 


TACGACCTTC 


TTGCACCACA 


ACGACGAAGC 


6ACGCC6TGG 


751 


TACGGCGACT 


CGGAGTGGAC 


GTACGTCAAG 


GGCAACCTCT 


CGAGCGTC6A 


801 


CCGCTCGTAC 


GGCGCGTTC6 


TGGACAACCT 


GAGCCACCAC 


ATTGGCACGC 


851 


ACCAGGTCCA 


CCACTTGTTC 


CCGATCATTC 


CGCACTACAA 


6CTCAACGAA 


901 


GCCACCAAGC 


ACTTTGCGGC 


CGCGTACCCG 


CACCTCGTGC 


GCAGGAACGA 


951 


CGAGCCCATC 


ATCACGGCCT 


TCTTCAAGAC 


CGCGCACCTC 


TTTGTCAACT 


1001 


ACGGCGCTGT 


GCCCGAGACG 


GCGCAGATCT 


TCACGCTCAA 


AGAGTC6GCC 


1051 


GCGGCCGCCA 


AGGCCAAGTC 


GGACTAA 







Figure 3 



Amino Acid Sequence of an Omega-3 Fatty AcidDesaturase (SDD17) irom Saprolegnia 

diclina (ATCC 56851) 

1 MTEDKTKVEF PTLTELKHSl PNACFESNLG LSLYYTARAI FNASASAALL 

51 YAARSTPFIA DNVLLHALVC ATYIYVQGVI FWGFFTVGHD CGHSAFSRYH 

101 SVNFIIGCIM HSAILTPFES WRVTHRHHHK NTGNIDKDEI FYPHRSVKDL 

151 QDVRQWVYTL GGAWFVYLKV GYAPRTMSHF DPWDPLLLRR ASAVIVSLGV 

201 WAAFFAAYAY LTYSLGFAVM GLYYY/^LFV FASFLVITTF LHHNDEATPW 

251 YGDSEWTYVK GNLSSVDRSY GAFVDNLSHH IGTHQVHHLF PIIPHYKLNE 

301 ATKHFAAAYP HLVRRNDEPI ITAFFKTAHL FVNYGAVPET AQIFTLKESA 

351 AAAKAKSD* 



Figure 4 



Comparative analysis of S. diclina Delta 17-desaturase (SDD17.pep) 
& Synechocystis sp. Delta IS-desaturase (SYCDESB) 



Erane: 2 initn: 733 initl: 305 opt: 689 Z-score: 996.8 expect(): 1.56-47 
40.9% identity in 269 aa overly 
(76-336:204-471) 

50 60 70 80 90 100 
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^ :||::||::hlllMl|::||: '••-A 
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530 560 590 620 650 680 

110 120 130 140 150 160 
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II : |: i|:h::|l:--ll II IIIUI W 1 1 i 1 ^ 1 
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340 350 
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SYCDESB VYYQSPSNGGYQKKPXLIIJESNQHRB(3?QY)a4VI^ 

1430 1460 1490 1520 1550 1580 



Figure 5 



Comparative analysis of S. diclina Delta 17-desaturase (SDDlT.pep) 
C. elegans Delta 17-desaturase (CELEFAT) 



Frane: 1 initn: 490 initl: 222 opt: 502 Z-score: 724.0 easpectO: 2.3e-32 
31.6% identity in 310 aa overlap 
(2-303:49-347) 



SDDl7.pep 
CELFM? 

SDD17.pep 

SDD17.pep 
CELFAT 

SM)17.pep 
CELEM? 

Sra)17.pep 
CELEM? 

SDD17.pep 
CELEKP 

SDD17.pep 
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I :: III :| 
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80 110 140 170 200 230 

40 50 60 70 80 90 
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|: I :: : ||| : | : h II Ih :::hlllil 
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150 
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::::| a MM I* I I J i \' ' t i 

MPSWKRWENPnTSGW[iOTFPVmjEX3PC--IX3SHFWITCSI^^ 
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Fi^YSmJTYSLG^AV^O^YyY2^ELFVFftSFLVIT^^ 

II II : ::: ||::|| |: : I I 1 : 1 = h I l"= I 1 1 = = ' ijll, 
VC3OTJVLTIftGSYSlMBWYYWVELSFPGI^^ 

770 800 830 860 890 920 
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SVDRSYGAF^^^NLSHHI-GTHQVHHLFPIIPHml^IE3«^ 

::|| II :|: III I Hhl lllhl HI' 
TIERYYGDGLIM^HHITDGHVZVHHFEW^^ 
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330 340 350 

FFRTAKJVNXGAVEETRQIFrU^ESW^^^ 

1130 1160 1190 1220 1250 1280 



Figure 6 



Gene Sequence of sddll, a Delta 12-Desaturase Gene 
from Saprolegnia diclina (ATCC 56851) 



P 



?! 5 



1 


AT6TGCAAAG 


GTCAAGCTCC 


TTCCAAGGCC 


GACGTGTTCC 


ACGCTGCGGG 


51 


GTACCeCCCG 


GTCGCCGGCA 


CGCCCGAGCC 


GCTGCCGCTG 


GAGCCCCCGA 


101 


CGATCACGCT 


CAAGGACCTG 


CGCGCGGCGA 


TCCCGGCCCA 


CTGCTTTGAG 


151 


CGCAGCGCT6 


CCACTAGCTT 


TTACCATTTG 


GCCAAGAACC 


TTGCGATCTG 


201 


CGCCGGCGTG 


TTCGCCGTTG 


GCCTCAAGCT 


CGCGGCTGCC 


GACTTGCCGC 


251 


TCGCGGCCAA 


GCTGGTCGCG 


TGGCCCATCT 


ACTGGTTCGT 


CCAGGGCACG 


301 


TACTTTACGG 


GCATCTGGGT 


CATTGCGCAC 


GAATGCGGCC 


ACCAGGCGTT 


351 


CTCGGCGTCC 


GAGATCCTCA 


ACGACACGGT 


CGGTATCATT 


CTTCACTCGC 


401 


TCCTCTTTGT 


GCCGTACCAC 


AGCTGGAAGA 


TCACGCACCG 


CCGCCACCAC 


451 


TCCAACAGGG 


GCAGCTGCGA 


GAACGACGAG 


GTGTTTACGC 


CGACGCCGCG 


501 


GTCCGTCGTC 


GAGGCCAAGC 


ACGACCACTC 


GCTCCTCGAA 


GAGA6CCCGC 


551 


TCTACAACCT 


GTACGGCATC 


GTCATGATGC 


TTCTCGTGGG 


CTGGATGCC6 


601 


GGCTACCTCT 


TCTTCAACGC 


GACCGGCCCG 


ACCAAGTAC6 


CTGGCCTCGC 


651 


: CAAGTCGCAC 


TTCAACCCGT 


ACGCAGCCTT 


TTTCCTCCCA 


AAGGAfiCGCC 


701 


TCAGCATCTG 


GtGGAGCGAC 


CTCTGCTTCC 


TCGCGGCCTT 


GTACGGCTTT 


751 


GGCTACGGCG 


TCTCGGTCTT 


CGGCCTCCTC 


GATGTCGCCC 


GCCACTACAT 


801 


CGTGCCGTAC 


ctcatttgca 


ACGCGTACCT 


CGTGCTCATC 


ACGTACCTCC 


851 


AGCACACGGA 


TACGTACGTG 


CCCCACTTCC 


GCGGCGACGA 


GTGGAACTGG 


901 


CTGCGCGGCG 


CGCTCTGCAC 


CGTCGACCGC 


TCGTTCGGCG 


CGTGGATCGA 


951 


CAGCGCGATC 


CACCACATTG 


CCGACACGCA 


CGT6ACGCAC 


CACATTTTCT 


1001 


CCAAGACGCC 


CTTCTACCAC 


GCGATC6A6G 


CGACCGACGC 


CATCACGCCC 


1051 


CTCCTCGGCA 


AGTACTACCT 


CATCGACCCG 


ACGCCGATCC 


CGCTG6CGCT 


1101 


CTGGCGCTC6 


TTCACGCACT 


GCAAGTACGT 


CGAGGACGAC 


GGCAACGTTG 


1151 


TGTTTTACAA 


GCGCAAGCTC 


GAG6AAAAGT 


AA 





Figure 7 

Amino Acid Sequence of a Delta 12-Desaturase (SDD12) from Saprolegnia diclina (ATCC 

56851) 



1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 

5i RSAATSFYHL AKNLAICAGV FAVGLKIAAA DLPLAAKLVA WPIYWPVQGT 

101 YFTGIWVIAH ECGHQAFSAS EILNDTVGII LHSLLFVPYH SWKITHRRHH 

151 SNTGSCENDE VFTPTPRSW EAKHDHSLLE ESPLYNLYGI VMMLLVGWMP 

201 GYLFFNATGP TKYAGLAKSH FNPYAAFFLP KERLSIWWSD LCFLAALYGF 

251 GYGVSVFGLL DVARHYIVPY LICNAYLVLI TYLQHTDTYV PHFRGDEWNW 

301 LRGALCTVDR SFGAWIDSAI HHIADTHVTH HIFSKTPFYH AIEATDAITP 

351 LLGKYYLIDP TPIPLALWRS FTHCKYVEDD GNWFYKRKL EEK* 



Figure 8 



Comparative analysis of S. diclina Delta 12-desaturase (SDD12.pep) 
& G. hirsutum Delta 12-desaturase (6H06DES) 



Frame: 3 initn: 992 initl: 413 opt: 1086 Z-score: 1531.8 expect(): 2.3e-77 

45.6% ijdentity in 379 aa overlep 
(9-380:14-384) 



SDD12 .pep 

SDDl2.pep 
(aiOSDESAT 

SDDl2.pep 
QI06DBSAT 
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SI^12.pep 
GEK)6DESAT 

SDD12 .pep 
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••111--:: II .•.•h:| : II : I II II l|:||||llllll 

SDBERLQVYISIJKIIIAVIYVLYKDUV^^ 

700 730 760 790 820 850 

290 300 310 320 330 340 

Hh ::lhllllll hlhlh :::::|:|:||lhlhll I ll|:|ll II 
AI^ira)SSEM)WLRGMiSTi>C)raDPGV^ 

880 910 940 970 1000 1030 



350 360 370 380 390 

SDD12.pep TPmSKYTfLIDPTPIEOLM^SFraCKWEDDGNWF^^ 

hlllll :| III hll :| ill I 
OK)6DESMr KPILGKram3TPIYKA^IWREMEXXYVEPDVGGGGG^ 

1060 1090 1120 1150 1180 1210 

CT56DESftT BRNQBRrmOXaOliGKLIINQ^ 

1240 1270 1300 1330 1360 



Figure 9 



Sequence ID: 

Sequence ID 1 

5'-ATC CGC GCC GCC ATC CCC AAG CAC TGC TGG GTC AAG-3' 
Sequence ID 2 

5 ' - GCC CTC TTC GTC CTC GGC CAY GAC TGC GGC CAY GGC TCG TTC TCG- 
3' 

Sequence ID 3 

5' -GAG RTG GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG-3' 
Sequence ID 4 

5' -CCC TAC CAY GGC TGG CGC ATC TCG CAY CGC ACC CAY CAY CAG AAC-3' 
Sequence ID 5 

5'-GTT CTG RTG RTG GGT CCG RTG CGA GAT GCG CCA GCC RTG GTA GGG- 3' 
Sequence ID 6 

5'- GGC TCG CAC TTC SAC CCC KAC TCG GAC CTC TTC GTC- 3' 
Sequence ID 7 

5'- GAC GAA GAG GTC CGA GTM GGG GTW GAA GTG CGA GCC- 3' 
Sequence ID 8 

5'- GCG CTG GAK GGT GGT GAG GCC GCC GCG GAW GSA CGA CCA- 3' 

Sequence ID 9 

5' - CTG GGG GAA GAG RTG RTG GAT GAC RTG GGT GCC GAT GTC RTG RTG- 
3' 



Sequence ID 10 



5 ' - GGT GGC CTC GAY GAG RTG GTA RTG GGG GAT CTK GGG GAA GAR RTG- 
3' 

Sequence ID 11 

5' -GAG RTG GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG- 3' 
Sequence ID 12 

5' -TAG GCG TAG CTC ACG TAG TCG CTC G-3' 
Sequence ID 13 

5'-TTC TTG CAC CAC AAC GAC GAA GCG ACG- 3' 
Sequence ID 14 

5'-GGA GTG GAC GTA CGT CAA GGG CAA C-3' 

Sequence ID 15 

5'-TCA AGG GCA ACC TCT CGA GCG TCG AC- 3' 
Sequence ID 16 

5'-CCC AGT CAC GAC GTT GTA AAA CGA CGG CCA G-3' 

Sequence ID 17 
5'- AGC GGA TAA CAA TTT CAC ACA GGA AAC AGC -3' 

Sequence ID 18 

5' -GGT AAA AGA TCT CGT CCT TGT CGA TGT TGC-3' 
Sequence ID 19 

5'-GTC AAA GTG GCT CAT CGT GC-3' 



Sequence ID 20 

5' -CGA GCG AGT ACG TGA GGT ACG CGT AC- 3' 
Sequence ID 21 

5' -TCA ACA GAA TTC ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC CCG-3' 



Sequence ID 22 

5' -AAA AGA AAG CTT CGC TTC CTA GTC TTA GTC CGA CTT GGC CTT GGC-3' 
Sequence ID 23 

5' -TCA ACA AAG CTT ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC 
CCG-3' 

Sequence ID 24 

5 ' -AAA AGA GAA TTC CGC TTC CTA GTC TTA GTC CGA CTT GGC CTT 
GGC-3' 

Sequence ID 25 

1 ATGACTGAGG ATAAGACGAA GGTCGAGTTC CCGACGCTCA CGGAGCTCAA 











dAACCTCaCiC 


CTCTCaCTCT 


101 


ACTACACGGC 


CCGCGCGATC 


TTCAACGCGT 


CGGCCTCGGC 


GGCGCTGCTC 


151 


TACGCGGCGC 


GCTCGACGCC 


GTTCATTGCC 


GATAACGTTC 


TGCTCCACGC 


201 


GCTCGTTTGC 


GCCACCTACA 


TCTACGTGCA 


GGGCGTCATC 


TTCTGGGGCT 


251 


TCTTCACGGT 


CGGCCACGAC 


TGCGGCCACT 


CGGCCTTCTC 


GCGCTACCAC 


301 


AGCGTCAACT 


TTATCATCGG 


CTGCATCATG 


CACTCTGCGA 


TTTTGACGCC 


351 


GTTCGAGAGC 


TGGCGCGTGA 


CGCACCGCCA 


CCACCACAAG 


AACACGGGCA 


401 


ACATTGATAA 


GGACGAGATC 


TTTTACCCGC 


ACCGGTCGGT 


CAAGGACCTC 


451 


CAGGACGTGC 


GCCAATGGGT 


CTACACGCTC 


GGCGGTGCGT 


GGTTTGTCTA 


501 


CTTGAAGGTC 


GGGTATGCCC 


CGCGCACGAT 


GAGCCACTTT 


GACCCGTGGG 


551 


ACCCGCTCCT 


CCTTCGCCGC 


GCGTCGGCCG 


TCATCGTGTC 


GCTCGGCGTC 


601 


TGGGCCGCCT 


TCTTCGCCGC 


GTACGCGTAC 


CTCACATACT 


CGCTCGGCTT 


651 


TGCCGTCATG 


GGCCTCTACT 


ACTATGCGCC 


GCTCTTTGTC 


TTTGCTTCGT 


701 


TCCTCGTCAT 


TACGACCTTC 


TTGCACCACA 


ACGACGAAGC 


GACGCCGTGG 


751 


TACGGCGACT 


CGGAGTGGAC 


GTACGTCAAG 


GGCAACCTCT 


CGAGCGTCGA 


801 


CCGCTCGTAC 


GGCGCGTTCG 


TGGACAACCT 


GAGCCACCAC 


ATTGGCACGC 


851 


ACCAGGTCCA 


CCACTTGTTC 


CCGATCATTC 


CGCACTACAA 


GCTCAACGAA 


901 


GCCACCAAGC 


ACTTTGCGGC 


CGCGTACCCG 


CACCTCGTGC 


GCAGGAACGA 


951 


CGAGCCCATC 


ATCACGGCCT 


TCTTCAAGAC 


CGCGCACCTC 


TTTGTCAACT 


1001 


ACGGCGCTGT 


GCCCGAGACG 


GCGCAGATCT 


TCACGCTCAA 


AGAGTCGGCC 


1051 


GCGGCCGCCA 


AGGCCAAGTC 


GGACTAA 







Sequence ID 26 

1 MTEDKTKVEF PTLTELKHSI PNACFESNLG LSLYYTARAI FNASASAALL 





51 


YAARSTPFIA 


DNVLLHALVC 


ATYIYVQGVI 


FWGFFTVGHD 


CGHSAFSRYH 




101 


SVNPIIGCIM 


HSAILTPFES 


WRVTHRHHHK 


NTGNIDKDEI 


FYPHRSVKDL 




151 


QDVRQWVYTL 


GGAWFVYLKV 


GYAPRTMSHF 


DPWDPLLLRR 


ASAVIVSLGV 




201 


WAAFFAAYAY 


LTYSLGFAVM 


GLYYYAPLFV 


FASFLVITTF 


LHHNDEATPW 




251 


YGDSEWTYVK 


GNLSSVDRSY 


GAFVDNLSHH 


IGTHQVHHLF 


PIIPHYKLNE 




301 


ATKHFAAAYP 


HLVRRNDEPI 


ITAFFKTAHL 


FVNYGAVPET 


AQIFTLKESA 




351 


AAAKAKSD* 












Sequence 


ID 27 










ass; 
1 : 


1 


ATGGCCCCGC 


AGACGGAGCT 


CCGCCAGCGC 


CACGCCGCCG 


TCGCCGAGAC 




51 


GCCGGTGGCC GGCAAGAAGG CCTTTACATG GCAGGAGGTC GCGCAGCACA 




101 


ACACGGCGGC 


CTCGGCCTGG 


ATCATTATCC 


GCGGCAAGGT 


CTACGACGTG 


\, 1 


151 


ACCGAGTGGG 


CCAACAAGCA 


CCCCGGCGGC 


CGCGAGATGG 


TGCTGCTGCA 




201 


CGCCGGTCGC 


GAGGCCACCG 


ACACGTTCGA 


CTCGTACCAC 


CCGTTCAGCG 


s 


251 


ACAAGGCCGA 


GTCGATCTTG 


AACAAGTATG 


AGATTGGCAC 


GTTCACGGGC 




301 


CCGTCCGAGT 


TTCCGACCTT 


CAAGCCGGAC 


ACGGGCTTCT 


ACAAGGAGTG 




351 


CCGCAAGCGC 


GTTGGCGAGT 


ACTTCAAGAA 


GAACAACCTC 


CATCCGCAGG 




401 


ACGGCTTCCC 


GGGCCTCTGG 


CGCATGATGG 


TCGTGTTTGC 


GGTCGCCGGC 




451 


CTCGCCTTGT 


ACGGCATGCA 


CTTTTCGACT 


ATCTTTGCGC 


TGCAGCTCGC 


i 


501 


GGCCGCGGCG 


CTCTTTGGCG 


TCTGCCAGGC 


GCTGCCGCTG 


CTCCACGTCA 




551 


TGCACGACTC 


GTCGCACGCG 


TCGTACACCA 


ACATGCCGTT 


CTTCCATTAC 




601 


GTCGTCGGCC 


GCTTTGCCAT 


GGACTGGTTT 


GCCGGCGGCT 


CGATGGTGTC 




651 


ATGGCTCAAC 


CAGCACGTCG 


TGGGCCACCA 


CATCTACACG 


AACGTCGCGG 




701 


GCTCGGACCC 


GGATCTTCCG 


GTCAACATGG 


ACGGCGACAT 


CCGCCGCATC 




751 


GTGAACCGCC 


AGGTGTTCCA 


GCCCATGTAC 


GCATTCCAGC 


ACATCTACCT 




801 


TCCGCCGCTC 


TATGGCGTGC 


TTGGCCTCAA 


GTTCCGCATC 


CAGGACTTCA 




851 


CCGACACGTT 


CGGCTCGCAC 


ACGAACGGCC 


CGATCCGCGT 


CAACCCGCAC 




901 


GCGCTCTCGA 


CGTGGATGGC 


CATGATCAGC 


TCCAAGTCGT 


TCTGGGCCTT 




951 


CTACCGCGTG 


TACCTTCCGC 


TTGCCGTGCT 


CCAGATGCCC 


ATCAAGACGT 




1001 


ACCTTGCGAT 


CTTCTTCCTC 


GCCGAGTTTG 


TCACGGGCTG 


GTACCTCGCG 




1051 


TTCAACTTCC 


AAGTAAGCCA 


TGTCTCGACC 


GAGTGCGGCT 


ACCCATGCGG 




1101 


CGACGAGGCC 


AAGATGGCGC 


TCCAGGACGA 


GTGGGCAGTC 


TCGCAGGTCA 



1151 AGACGTCGGT CGACTACGCC CATGGCTCGT GGATGACGAC GTTCCTTGCC 

1201 GGCGCGCTCA ACTACCAGGT CGTGCACCAC TTGTTCCCCA GCGTGTCGCA 

1251 GTACCACTAC CCGGCGATCG CGCCCATCAT CGTCGACGTC TGCAAGGAGT 

1301 ACAACATCAA GTACGCCATC TTGCCGGACT TTACGGCGGC GTTCGTTGCC 

1351 CACTTGAAGC ACCTCCGCAA CATGGGCCAG CAGGGCATCG CCGCCACGAT 

1401 CCACATGGGC TAA 



Sequence 
1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



ID 28 
ATGGCAAACA 
CGTGGACCAG 
TCCCGATGAT 
GCCATGCTGC 
TGAGCTCAAG 
CCTTGTACAT 
AAAGTGTTTG 
CATGTCTCGC 
TGGATACCGC 
TTGCATGTGT 
CAAGTACGCT 
TCGTGCACAC 
GGGTTCGTGA 
GTTCATGGCA 
ACTACCCACA 
CTTGCCCTCT 
AAAGAGCAAG 



GCAGCGTGTG 
TGGATGGATG 
GGACGTGTCC 
TCTTCGGCAT 
ACCATCAAGC 
GTGCGTGGAG 
GAAACGACAT 
ATCGTGTACG 
CATCATGATC 
ACCACCATGC 
CCAGGAGGTG 
CGTCATGTAC 
AGCCAATCAA 
ATGCTTGTGC 
GGCTCTTGTG 
TCGGCAACTT 
ACCAACTAA 



GGATGATGTG 
GCGCCAAGCC 
ACCATGCTGG 
CCCGATCATG 
TCTTGCACAA 
ACCATCCGCC 
GGAGAAGGGC 
TGTTCTGCGT 
CTTTGCAAGA 
CACCATTTTT 
ATGCGTACTT 
GCATACTACT 
GCCGTACATC 
AGTCCTTGTA 
CAGCTTCTTG 
TTTTGTGCAG 



GTGGGCCGCG 
GTACGCACTC 
CATTCGAGGT 
AAGCAGATGG 
CTTGTTTCTC 
AGGCTATCCT 
AACGAGTCTC 
GTCCAAGGCA 
AGTTCAACCA 
GCCATCTGGT 
TTCAGTGATC 
TCTTCTCCTC 
ACCACCCTTC 
CGACTACCTC 
GAGTGTACAT 
AGCTATCTTA 



TGGAGACCGG 
ACCGATGGGC 
GGGATACATG 
AGAAGCCTTT 
TTCGGACTTT 
CGGAGGCTAC 
ATGCTCAGGG 
TACGAGTTCT 
GGTTTCCTTC 
GGGCTATCGC 
CTCAACTCTT 
CCAAGGGTTC 
AGATGACCCA 
TTCCCATGCG 
GATCACCTTG 
AAAAGCCAAA 



Sequence ID 29 

1 MTVGFDETVT MDTVRNHNMP DDAWCAIHGT VYDITKFSKV HPGGDIIMLA 

51 AGKEATILFE TYHIKGVPDA VLRKYKVGKL PQGKKGETSH MPTGLDSASY 
101 YSWDSEFYRV LRERVAKKLA EPGLMQRARM ELWAKAIFLL AGFWGSLYAM 
151 CVLDPHGGAM VAAVTLGVFA AFVGTCIQHD GSHGAFSKSR FMNKAAGWTL 
201 DMIGASAMTW EMQHVLGHHP YTNLIEMENG LAKVKGADVD PKKVDQESDP 
251 DVFSTYPMLR LHPWHRQRFY HKFQHLYAPL IFGFMTINKV ISQDVGWLR 
301 KRLFQIDANC RYGSPWNVAR FWIMKLLTTL YMVALPMYMQ GPAQGLKLFF 



351 MAHFTCGEVL ATMFIVNHII 

401 QKALSAAEST KSDADKTTMI 

451 NHQIEHHCFP QNPHTVNVYI 

501 HLRTLGNEDL TAWST* 

Sequence ID 30 

5'-CCG SAG TTC ACS ATC AAG 
CAC TGC TTC -3' 

Sequence ID 31 

S'-GRS CTT CTT GAK GTG GWM 
CGG CAT-3' 



EGVSYASKDA VKGVMAPPRT VHGVTPMQVT 
PLNDWAAVQC QTSVNWAVGS WFWNHFSGGL 
SGIVKETCEE YGVPYQAEIS LFSAYFKMLS 



GAG ATC CGC GAS KSC ATC CCG GCC 

SGT GGC CTC CTC GGC GTG GTA GWR 



Sequence ID 32 

5' -CCS STC TAC TGG GCC TGC CAG GGT RTC GTC CTC ACS GGT GTC 
TGG-3' 

Sequence ID 33 

5' -CCS STC TAC TGG ATC RYS CAG GGT RTC GTC KGY ACS GGT GTC 
TGG-3' 

Sequence ID 34 

5' -GGC GTG GTA GTG CGG CAT SMM CGA GAA GAR GTG GTG GGC GAC 
GTG- 3' 



Sequence ID 35 

5' -CAC GTA CCT CCA GCA CAC GGA CAC CTA CG-3' 



Sequence ID 36 

5'- GAT CGA CAG CGC GAT CCA CCA CAT TGC- 3' 



Sequence ID 37 

5' - CAA ATG GTA AAA GCT AGT GGC AGC GCT GC-3' 



Sequence ID 38 

5'-AGT ACG TGC CCT GGA CGA ACC AGT AGA TG-3' 

Sequence ID 3 9 

5 ' - TCA ACA GAA TTC ATG TGC AAA GGT CAA GCT CCT TCC AAG GCC 
GAC GTG -3' 

Sequence ID 40 

5' - AAA AGA AAG CTT TTA CTT TTC CTC GAG CTT GCG CTT GTA AAA 
CAC AAC-3' 

Sequence ID 41 

1 ATGTGCAAAG GTCAAGCTCC TTCCAAGGCC GACGTGTTCC ACGCTGCGGG 



51 


GTACCGCCCG 


GTCGCCGGCA 


CGCCCGAGCC 


GCTGCCGCTG 


GAGCCCCCGA 


101 


CGATCACGCT 


CAAGGACCTG 


CGCGCGGCGA 


TCCCGGCCCA 


CTGCTTTGAG 


151 


CGCAGCGCTG 


CCACTAGCTT 


TTACCATTTG 


GCCAAGAACC 


TTGCGATCTG 


201 


CGCCGGCGTG 


TTCGCCGTTG 


GCCTCAAGCT 


CGCGGCTGCC 


GACTTGCCGC 


251 


TCGCGGCCAA 


GCTGGTCGCG 


TGGCCCATCT 


ACTGGTTCGT 


CCAGGGCACG 


301 


TACTTTACGG 


GCATCTGGGT 


CATTGCGCAC 


GAATGCGGCC 


ACCAGGCGTT 


351 


CTCGGCGTCC 


GAGATCCTCA 


ACGACACGGT 


CGGTATCATT 


CTTCACTCGC 


401 


TCCTCTTTGT 


GCCGTACCAC 


AGCTGGAAGA 


TCACGCACCG 


CCGCCACCAC 


451 


TCCAACACGG 


GCAGCTGCGA 


GAACGACGAG 


GTGTTTACGC 


CGACGCCGCG 


501 


GTCCGTCGTC 


GAGGCCAAGC 


ACGACCACTC 


GCTCCTCGAA 


GAGAGCCCGC 


551 


TCTACAACCT 


GTACGGCATC 


GTCATGATGC 


TTCTCGTGGG 


CTGGATGCCG 


601 


GGCTACCTCT 


TCTTCAACGC 


GACCGGCCCG 


ACCAAGTACG 


CTGGCCTCGC 


651 


CAAGTCGCAC 


TTCAACCCGT 


ACGCAGCCTT 


TTTCCTCCCA 


AAGGAGCGCC 


701 


TCAGCATCTG 


GTGGAGCGAC 


CTCTGCTTCC 


TCGCGGCCTT 


GTACGGCTTT 


751 


GGCTACGGCG 


TCTCGGTCTT 


CGGCCTCCTC 


GATGTCGCCC 


GCCACTACAT 


801 


CGTGCCGTAC 


CTCATTTGCA 


ACGCGTACCT 


CGTGCTCATC 


ACGTACCTCC 


851 


AGCACACGGA 


TACGTACGTG 


CCCCACTTCC 


GCGGCGACGA 


GTGGAACTGG 


901 


CTGCGCGGCG 


CGCTCTGCAC 


CGTCGACCGC 


TCGTTCGGCG 


CGTGGATCGA 


951 


CAGCGCGATC 


CACCACATTG 


CCGACACGCA 


CGTGACGCAC 


CACATTTTCT 


1001 


CCAAGACGCC 


CTTCTACCAC 


GCGATCGAGG 


CGACCGACGC 


CATCACGCCC 


1051 


CTCCTCGGCA 


AGTACTACCT 


CATCGACCCG 


ACGCCGATCC 


CGCTGGCGCT 


1101 


CTGGCGCTCG 


TTCACGCACT 


GCAAGTACGT 


CGAGGACGAC 


GGCAACGTTG 



1151 TGTTTTACAA GCGCTiJlGCTC GAGGAAAAGT AA 
Sequence ID 42 

1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 



51 


RSAATSFYHL 


AKNLAICAGV 


FAVGLKLAAA 


DLPLAAKLVA 


WPIYWFVQGT 


101 


YFTGIWVIAH 


ECGHQAFSAS 


EILNDTVGII 


LHSLLFVPYH 


SWKITHRRHH 


151 


SNTGSCENDE 


VFTPTPRSW 


EAKHDHSLLE 


ESPLYNLYGI 


VMMLLVGWMP 


201 


GYLFFNATGP 


TKYAGLAKSH 


FNPYAAFFLP 


KERLSIWWSD 


LCFLAALYGF 


251 


GYGVSVFGLL 


DVARHYIVPY 


LICNAYLVLI 


TYLQHTDTYV 


PHFRGDEWNW 


301 


LRGALCTVDR 


SFGAWIDSAI 


HHIADTHVTH 


HIFSKTPFYH 


AIEATDAITP 


351 


LLGKYYLIDP 


TPIPLALWRS 


FTHCKYVEDD 


GNWFYKRKL 


EEK* 



